# Few-shot Training
Tinyllava Video Coldstart NextQA 16
Apache-2.0
TinyLLaVA-Video-R1 is a video-text-to-text model obtained through cold-start training of TinyLLaVA-Video using 16 manually annotated samples from the NextQA dataset.
Video-to-Text
Transformers

T
Zhang199
63
0
R1 Aqa
Apache-2.0
R1-AQA is an audio question answering model based on Qwen2-Audio-7B-Instruct, optimized through Group Relative Policy Optimization (GRPO) algorithm, achieving state-of-the-art performance in the MMAU benchmark.
Audio-to-Text
Transformers

R
mispeech
791
14
Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013 7e 05 Finetuned SFEW 7e 05
Apache-2.0
An image classification model based on the BEiT architecture, fine-tuned on the FER2013 dataset for facial expression recognition
Image Classification
Transformers

B
lixiqi
18
0
Sd Class Wikiart From Bedrooms
MIT
This is a diffusion model initialized with Google's DDPM bedroom image model and fine-tuned on the WikiArt dataset for unconditional image generation.
Image Generation
S
johnowhitaker
278
0
Summarizer Cnndm
An English text summarization generator fine-tuned on the BART model, trained on the CNN-DailyMail dataset
Text Generation
Transformers English

S
yuvraj
18
0
Featured Recommended AI Models